Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding

نویسندگان

  • Lizhong Wu
  • Mahesan Niranjan
  • Frank Fallside
چکیده

I Recent studies have shown that non-linear prediction can be implemented with neural networks, and non-linear predictors will on average achieve about 2 3 improvement in prediction gain over conventional linear predictors. In this paper, we take the advantage of non-linear prediction with neural network, apply it to predictive speech coding and attempt to improve the speech coding performance. Our studies are concentrated on non-linear prediction with neural network, non-linear predictive vector quantiser, non-linear long term (pitch) speech prediction, the output variance of the non-linear predictive synthesis lter to its input distortion and fully vector quantised code-excited non-linear predictive speech coding. The above studies have resulted in a fully vector quantised CENN-excited non-linear predictive speech coder. Performance evaluations and comparisons with the linear predictive speech coding are made in this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A mixed sinusoidally excited linear prediction coder at 4 kb/s and below

There is currently a great deal of interest in the development of speech coding algorithms capable of delivering toll quality at 4 kb/s and below. For synthesizing high quality speech, accurate representation of the voiced portions of speech is essential. For bit rates of 4 kb/s and below, conventional Code Excited Linear Prediction (CELP) may likely not provide the appropriate degree of period...

متن کامل

A Robust Frame-based Nonlinear Prediction System for Automatic Speech Coding

In this paper, we propose a neural-based coding scheme in which an artificial neural network is exploited to automatically compress and decompress speech signals by a trainable approach. Having a two-stage training phase, the system can be fully specified to each speech frame and have robust performance across different speakers and wide range of spoken utterances. Indeed, Frame-based nonlinear...

متن کامل

Speaker dependent mapping for low bit rate coding of throat microphone speech

Throat microphones (TM) which are robust to background noise can be used in environments with high levels of background noise. Speech collected using TM is perceptually less natural. The objective of this paper is to map the spectral features (represented in the form of cepstral features) of TM and close speaking microphone (CSM) speech to improve the former’s perceptual quality, and to represe...

متن کامل

Wideband coding of speech using neural network gain adaptation

In this paper, a high-quality wideband speech coder is proposed. The coding structure resembles a LD-CELP coder, however, several novel improvements are made. The gain adapter for the stochastic codebook is driven by a neural network and it updates the excitation gain in a sample-by-sample fashion. The purpose of incorporating a neural network is to exploit both the intraand inter-frame correla...

متن کامل

Predictive vector quantization using the M-algorithm for distributed speech recognition

In this paper we present a predictive vector quantizer for distributed speech recognition that makes use of a delayed decision coding scheme, performing the optimal codeword searching by means of the M-algorithm. In single-path predictive vector quantization coders, each frame is coded with the closest codeword to the prediction error. However, prediction errors and quantization errors of futur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 2  شماره 

صفحات  -

تاریخ انتشار 1994